DeepLearning.AI
AI is the new electricity and will transform and improve nearly all areas of human lives.

Quick Guide & Tips

💻   Accessing Utils File and Helper Functions

In each notebook on the top menu:

1:   Click on "File"

2:   Then, click on "Open"

You will be able to see all the notebook files for the lesson, including any helper functions used in the notebook on the left sidebar. See the following image for the steps above.


🔄   Reset User Workspace

If you need to reset your workspace to its original state, follow these quick steps:

1:   Access the Menu: Look for the three-dot menu (⋮) in the top-right corner of the notebook toolbar.

2:   Restore Original Version: Click on "Restore Original Version" from the dropdown menu.

For more detailed instructions, please visit our Reset Workspace Guide.


💻   Downloading Notebooks

In each notebook on the top menu:

1:   Click on "File"

2:   Then, click on "Download as"

3:   Then, click on "Notebook (.ipynb)"


💻   Uploading Your Files

After following the steps shown in the previous section ("File" => "Open"), then click on "Upload" button to upload your files.


📗   See Your Progress

Once you enroll in this course—or any other short course on the DeepLearning.AI platform—and open it, you can click on 'My Learning' at the top right corner of the desktop view. There, you will be able to see all the short courses you have enrolled in and your progress in each one.

Additionally, your progress in each short course is displayed at the bottom-left corner of the learning page for each course (desktop view).


📱   Features to Use

🎞   Adjust Video Speed: Click on the gear icon (⚙) on the video and then from the Speed option, choose your desired video speed.

🗣   Captions (English and Spanish): Click on the gear icon (⚙) on the video and then from the Captions option, choose to see the captions either in English or Spanish.

🔅   Video Quality: If you do not have access to high-speed internet, click on the gear icon (⚙) on the video and then from Quality, choose the quality that works the best for your Internet speed.

🖥   Picture in Picture (PiP): This feature allows you to continue watching the video when you switch to another browser tab or window. Click on the small rectangle shape on the video to go to PiP mode.

√   Hide and Unhide Lesson Navigation Menu: If you do not have a large screen, you may click on the small hamburger icon beside the title of the course to hide the left-side navigation menu. You can then unhide it by clicking on the same icon again.


🧑   Efficient Learning Tips

The following tips can help you have an efficient learning experience with this short course and other courses.

🧑   Create a Dedicated Study Space: Establish a quiet, organized workspace free from distractions. A dedicated learning environment can significantly improve concentration and overall learning efficiency.

📅   Develop a Consistent Learning Schedule: Consistency is key to learning. Set out specific times in your day for study and make it a routine. Consistent study times help build a habit and improve information retention.

Tip: Set a recurring event and reminder in your calendar, with clear action items, to get regular notifications about your study plans and goals.

☕   Take Regular Breaks: Include short breaks in your study sessions. The Pomodoro Technique, which involves studying for 25 minutes followed by a 5-minute break, can be particularly effective.

💬   Engage with the Community: Participate in forums, discussions, and group activities. Engaging with peers can provide additional insights, create a sense of community, and make learning more enjoyable.

✍   Practice Active Learning: Don't just read or run notebooks or watch the material. Engage actively by taking notes, summarizing what you learn, teaching the concept to someone else, or applying the knowledge in your practical projects.


📚   Enroll in Other Short Courses

Keep learning by enrolling in other short courses. We add new short courses regularly. Visit DeepLearning.AI Short Courses page to see our latest courses and begin learning new topics. 👇

👉👉 🔗 DeepLearning.AI – All Short Courses [+]


🙂   Let Us Know What You Think

Your feedback helps us know what you liked and didn't like about the course. We read all your feedback and use them to improve this course and future courses. Please submit your feedback by clicking on "Course Feedback" option at the bottom of the lessons list menu (desktop view).

Also, you are more than welcome to join our community 👉👉 🔗 DeepLearning.AI Forum


Sign in

Or, sign in with your email
Email
Password
Forgot password?
Don't have an account? Create account
By signing up, you agree to our Terms Of Use and Privacy Policy

Create Your Account

Or, sign up with your email
Email Address

Already have an account? Sign in here!

By signing up, you agree to our Terms Of Use and Privacy Policy

Choose Your Plan

Planning for more users?
Learn More

What best describes you?

This helps us tune the catalog to suit you best.

Join Team Success

You have successfully joined undefined

You now have access to all Pro features. Click below to start learning!

Session Expired

Session expired — please return to Cornerstone to restart the session and complete the course.

DeepLearning.AI
/
Voice for AI Agents and Applications
  • All Courses
DeepLearning.AI
/
Voice for AI Agents and Applications
  • All Courses
DeepLearning.AIAll Courses
Voice for AI Agents and Applications
DeepLearning.AI
Voice for AI Agents and Applications

Course Syllabus

Elevate Your Career with Full Learning Experience

Unlock Plus AI learning and gain exclusive insights from industry leaders

Access exclusive features like graded notebooks and quizzes
Earn unlimited certificates to enhance your resume
Starting at $1 USD/mo after a free trial – cancel anytime
Welcome to Voice for AI Agents and Applications, built in partnership with Vocal Bridge, an AI Fund portfolio company and taught by CEO Ashwyn Sharma. In this course, you'll learn how to build agents with voice interactive user interfaces. You add voice to your existing agents, as well as for agents that use voice as a function call tool. You also apply voice evals to your agents with detailed feedback on quality and accuracy to enhance them. The vast majority of people on this planet find speaking and listening much easier than writing and reading. And as voice UIs become more reliable, it'll open up many new applications for many people. For example, in The Batch newsletter, I wrote about my building a simple cat themed math quiz application for my seven-year-old daughter. She's enjoyed using the keyboard to play this game, but using Vocal Bridge, I was able to add a voice UI pretty quickly. So I can now quiz her verbally in a friendly way and she can respond verbally as well. And this really changes how the experience feels. Voice applications have historically faced a trade-off between latency and intelligence. One option is to use native voice-in, voice-out or voice-to-voice real-time models, and they're fast, but less reliable and harder to control. The other approach is to use a pipeline that inputs audio, uses speech-to-text, then has an LLM execute an agentic workflow, and then finally text-to-speech to read out the response. That's more reliable and more controllable, but adds latency, which is a problem for real-time conversations. In this course, you learn how to build voice applications that are both fast and reliable. Vocal Bridge, which you'll learn to use, relies on a custom architecture with a fast foreground agent for real-time conversation, as well as a background agent for reasoning workflows, guardrails, and tools. This gives your voice agents a best combination of both low latency and high intelligence. Thanks Andrew. This course teaches three integration patterns that meet you where you are. The first pattern is voice embedded in applications. Think of a game or a productivity tool where users can use voice commands, but they can also click and interact with the UI directly. The voice agent needs to trigger UI changes when the user speaks, and it needs to know when the user clicks on something. That bidirectional awareness is what makes the experience feel natural. The second pattern is voice for existing agents. You've already built a Claude, GPT, or LangChain powered agent with custom logic. You don't want to rewrite it. Vocal Bridge sits as a thin layer in front of your agent. It handles voice to intent conversion, but understanding when to delegate a question to your agent versus handling conversational pleasantries itself. The third pattern is the voice as a tool your LLM can call. Imagine a recruiting agent coordinating interviews. A candidate texts their availability and the agent responds in text to confirm and then also calls and coordinates the interview date that works the best for the candidate. Or a brainstorming agent, working with you via chat. It says, this would be faster to talk through, and opens a live voice session. So the agent chooses the right modality for the moment. We will also have Scott Johnston, former CEO of Docker, and board member of Vocal Bridge to share with us all the recent developments in bringing AI voice and the landscape of voice agents and applications to production. Many people have worked to create this course. I'd like to thank Eli Chen and Rakesh Utekar from AI Fund, and Aditi Dhar and Jitesh Gupta from Vocal Bridge. From DeepLearning.AI, Brendan Brown and Esmaeil Gargari also contributed to this course. In the first lesson, you will learn about the traditional voice agent stack in detail. So you understand what's being abstracted away by Vocal Bridge. Then you will see live demos of all three integration patterns we discussed. By the end of the lesson, you will know which pattern fits your use case and start building them in the next lessons. This sounds great. Let's get started.
course detail
Voice for AI Agents and Applications
  • Introduction
    Video
    ・
    4m
  • Overview of Voice UI
    Video
    ・
    9m
  • Voice in Your App
    Video with Code Example
    ・
    10m
  • Voice for Your Agent
    Video with Code Example
    ・
    12m
  • Voice as a Tool
    Video with Code Example
    ・
    9m
  • Voice AI Evals
    Video with Code Example
    ・
    10m
  • Voice Agents in Production
    Video
    ・
    8m
  • Conclusion
    Video
    ・
    1m
  • Glossary
    Reading
    ・
    10m
  • (Optional) Create a Vocal Bridge Account
    Code Example
    ・
    1m
  • Quiz

    Graded・Quiz

    ・
    10m
    Course Details